An Interesting Fitting Formula of Quark Masses 
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Abstract 

We show an interesting empirical formula of quark masses here, which is found by 
implementing a least squares fit. In this formula the measured QCD coupling is 
almost a "best fitting coupling". 

There are many papers working on the masses of quarks, in theory or in experiment. 
For theory, the difficulty is, masses of quarks can not been determined by first principle. 
The masses of known six quarks correspond to six free parameters in standard model (SM). 
On the other hand, there appears a mystery problem in the SM, that is, comparing to t 
quarks, the other five quarks can be considered as ones with vanishing mass. However, there 
are empirical formulae to illuminate this hierarchy problem, such as reference pQ. We show 
another interesting mass fitting for the known six quarks in this note, which can be compared 
with the results in pQ. Although it is empirical, this formula is significant agreement with 
experiment values. We expect this formula can help us to discover the correct theory of 
flavor. A byproduct of this formula is, that the best fitting coupling is just the experiment 
one. 

We first list the MS quark masses in table 1. These data are from reference in different 
scale. Because pole masses of light quarks are physical meaningless and pole masses of heavy 
quarks also have evil definitions'"', we do not use the data of pole mass here. 





n g = 1 


n g = 2 


n 9 = 3 


n s = 1 


3.25 ± 1.75Mev(2Gev) 


1.2 ± 0.2Gev(m c ) 


174 ± 5Gev^ 


n s = 2 


7.0 ± 2.0Mev(2Gev) 


115 ± 35Mev(2Gev) 


4.25 ± 0.2Gei?(m 6 ) 



Table 1: Masses of quarks in MS scheme from j2|. n g is the number of generator and 
n s is the number of "isospin": n s = 3(1— \Q\), where Q is the charge of quark. Since 
reference [2] only gives mass ranges of different quarks, we take its midpoint as our 
input. For instance, the mass of u quark 3.25 ± 1.75Mev(2Gev) just corresponds to 
the range 1.5Mev < m u {2Gev) < 5Mev in reference |2j. The quantities in bracket 
are scales where we obtained quark masses. 

b This is the I^ole mass of top quark, which should be converted into MS mass 
using Eqn. (fTliGl. 



We then renormalize different quark masses to the mass of Z particle mz = 91.2Gev 
using renormaiization group equation. Relevant formulae have been shown in reference 0]. 
We take here a s (m z ) = 0.117 ^ , which corresponds A^ 5 ) = l97Mev, = 27AMev and 
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A® = 3WMev. The results have been shown in Table 2. As argued in Tab. 1, one should 
convert the pole mass of t quark into MS mass. Since rrit » m q , where q = u, d, s, c, b, 
the mass is converted by a simpler relation, 

m P° le = m(m)[l + + (13.44 - 1.04 x 5)(^^) 2 ]. (1) 

3tt 7T 





n g = 1 


n g = 2 


n g = 3 


n s = 1 


0.00019(10) 


0.597(100) 


173(5) 


n s = 2 


0.00041(12) 


0.067(21) 


2.94(17) 



Table 2: The MS masses (Gev) at scale mz- 



In QCD, the contributions of quark mass to high energy parameters, such as anomalous 
dimension functions, decay behaviors beyond tree level, coupling, and even more, the evolu- 
tions of masses themselves, have the form In—, where mn is some subtraction mass. Thus, 

' mo ' u ' 

we consider here the relations of the logarithm of quark mass, for instance, lnm/mo, where 
we choose mo = lGev . The results are listed in Table 3. 





n g = 1 


n g = 2 


n g = 3 


n s = 1 


-6.26(54) 


-0.515(167) 


5.155(29) 


n s = 2 


-5.49(29) 


-2.696(304) 


1.079(59) 



Table 3: y(n g ,n s ) = lnm/mo a t scale mz, where mo = lGev. 

We find that, for definite n s , there is an approximate linear relation among different n g . 
One can use formula ain s n g + a2"n g + a^n s + 04 to fit quark masses. We use the least squares 
method with wight to obtain aj. That is, find parameters to minimize function 

/= ^2 (y(n g ,n s ) - (ain s n g + a 2 n g + a 3 n s + a 4 )) 2 w(n s ,n g ), (2) 

n B =l,2,3,n s =l,2 

where w(n s ,n g ) is the wight: w(n s ,n g ) = dy~ 2 (n s ,n g ), where dy is the error of y = lnm/mo- 
The coefficients are a x = -2.31(20), a 2 = 7.99(32), a 3 = 2.85(58), and a 4 = -14.73(94). 
Notice that 

a\n s n g + a 2 n s + a 3 n g + a 4 = (ain g + a 3 )(n s + — ) + (a 4 - -^), (3) 

ai ai 

a more convenient approach is to redefine mo and then fit mass using (c\n g + C2){n s + c 3 ). 
From Eqn. OJwe let m satisfy In = a 4 - ^f- = -4.886, or m (m z ) = 7.55Mev. Data 
of lnm/mo have been shown in Table 4. 





n 9 = 1 


n 9 = 2 


n 9 = 3 


n s = 1 


-1.375(538) 


4.372(167) 


10.041(30) 


n s = 2 


-0.608(286) 


2.191(304) 


5.965(59) 



Table 4 : y(n g ,n s ) = lnm/mo at scale mz, where mo = 0.00755Get>. 
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Using the least squares method, that is, minimizing the function 



^2 (y(n g ,n s ) - (c 1 n g + c 2 )(n s + c 3 )) 2 w{n s ,n g ), (4) 
n 9 =l,2,3,ra s =l,2 

one obtains c\ = —2.311(25), c 2 = 2.848(28) and C3 = —3.459(35). In fact, one can use 
a more symmetrical form, c\{n g + c 2 )(n s + C3), where c' 2 = c 2 /c\, to fit y(n g ,n s ). Notice 
C3 ~ 3c2/ci, one can furthermore reduce parameters cl, c2, c3 into two parameters cl, c2, 
if he performs constraint C3 = 2>c 2 /c\ = 3c' 2 . We did not do it here. Quark masses and the 
fitting formula are plotted in Fig. 1. This empirical formula can be compared with the results 
introduced in ref. pQ. 

To discuss the quality of fitting in Eqn. Q, we study modified coefficient of determination 
for statistics: 

2 _ SSE/N E 
K _1 SST/Nt ' { ) 

In Eqn. © Ne = 6 — 3 — 1 = 2 is the degree of sum squared error (SSE), SSE = 
J2 (y(n g ,n s ) — y(n g ,n s )) 2 , where y is the fitting value of y, Nt = 6 — 1 — 1 = 4 is the 

n s ,n g 

degree of total sum of squares (SST), SST = J2 {y{ n gi n s) — y) 2 , where y is the average of 

n s ,n g 

V'- V = \ y( n gi n s)- The additional subtraction of 1 is due to subtraction Constance mo- 

n s ,n g 

In other literature R 2 is written as R 2 . 

Generally, < R 2 < 1. One obtains here that R 2 = 0.99584, which is very close to 1. 
This roughly means that, about 99.6 percent of the mass statistics can be interpreted by this 
empirical fitting. Or, the part which can not been interpreted by the fitting is no more than 
1%. We conclude that this fitting is a quite good empirical formula. 

All the calculations given above depend on the coupling a s (raz)- But, a s (mz) itself is 
determined by experiment and it also has error. Therefore, it is interesting to study the 
fitting behavior at different a s (mz)- Due to the experiment interesting, we vary a s {mz) 
from 0.09 to 0.13 here. We repeat all the calculations, taking Tab. 1 as input, and then 
study the behavior of modified coefficient of determination R 2 for the fitting formula ©, 
dSJ 1 . The results are shown in Fig 2. One just finds that R 2 approaches its maximum at the 
range 0.115 < a s (mz) < 0.12. We call this coupling as best fitting coupling a b , where the 
fitting formula @ works best. We use da b s = (da b ,/dm(n s ,n g )) 2 dm 2 (n s ,n g ), where 



m(n s ,n g ) is the data in Tab. 1 and da b s / 'dm(n s ,n g ) can be extracted by a small shifting of 
m(n s ,n g ), to estimate error of a b s: da b . The result is a b = 0.118(31). The big error is mainly 
due to the errors of u, d, s quarks masses. 

We see that the measured coupling is very close to the best fitting coupling. Here is a 
possible interpretation. As we know, the masses in Tab. 1 are mean ones, which are extracted 
by connecting various theories and experiments. In this sense, we say that the data in Tab. 
1 is unprejudiced estimation of the true masses of quarks, as long as we have performed 
enough estimations. Suppose QCD and renormalization theory are both right theories and 
remember that the mean measured coupling a s {mz) = 0.117 is just the estimation of the 
true coupling a*. We conclude that, if fitting equation is a right or an approximate right 
behavior, a b should also be a estimation to a* , although a* itself depends on the level of loops 
calculations. On one hand, if the behavior of y(n g , n s ) is complete random or is not linear at 

1 Here, for simplification, we just make an assumption that the data in Tab. 1 is irrelvant with a s (mz). 
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all, or in other words, the fitting (j3J) is not a correct one, one should obtain two bad results, 
one is that R 2 is not so close to 1, the other is that, generally, a s ^ a l s , unless by chance. 
Since R 2 is very close to unitary, we expect the equation (|3J) is a right or an approximate 
right behavior of quarks. On the other hand, when one says that the fitting equation @ is a 
right or an approximate right behavior, he always implies that this statement is obtained at 
correct coupling, a*. This means that, when the coupling deviates away a*, the fitting will 
go to bad. Or, in other words, the a* should be equal to the best coupling, a b s , provided one 
takes correct mass input in Tab. 1. Therefore, a b is also an estimation of a*. It is understood 
that a b s ~ a s . 

From Fig. 1 and R 2 check, the linear fitting is quite good agreement with the experiment 
data, s quark lies a little below the fitting line, (or on the contrary, d quark lies a little 
above the fitting line,) which may be considered as the correction due to QED and statistic 
error. In fact, if one uses linear fitting to fit the masses of leptons, which have \Q\ = 1, the 
experiment of [i lepton should lies above the fitting line( Since the lepton does not enjoy 
strong interaction, we do not discuss the fitting for lepton masses in detail here). 

If the logarithm of quark masses does have linear behavior, there are some interesting 
infers immediately. 

For instance, one can use this formula to extract masses of four generator quarks. For the 
heavier quark t' , we get m%i{mz) = 51(17)Tev, which is beyond our experiment capability. 
But the searching of lighter quark b' is not beyond our experiment capability. In fact, using 
extraction of the linear fitting, we obtain my{mz) = 85(23)Gev, which corresponds to the 
pole mass mj," 16 = 91(25)Gev. In other words, my le ~ mz dramatically. 

It is also a puzzle that whether the fourth-generate quarks exist or not. Until nowadays, 
we did not find the fourth- generate quarks. But since m^f e ~ mz, one should check the data 
at the vicinity around mz more carefully. According to reference [3], the mass gap between 
the fourth-generate quarks is so large that possibly there is no fourth-generate quark at all. 

It is hard to understand why top quark has so large mass in SM. In some seesaw mecha- 
nism, for instance, the source of top quark is significant different with that of other quarks ^ 
and therefore the formula of top quark is also different with that of other quarks. But we 
see here that the mass of top quark extracted from the linear behavior of the logarithm of 
quark mass agrees well with the measured ones. This implies that the top quark is also a 
" common" quark and the masses of top quark and all the other known quarks share the same 
source, although m t » mz- We expect this is helpful to understand the source of particle 
masses and correct theory of flavor. 

In some references, the vanishing of up quark mass is used to solve strong CP puzzle in 
QCD. However, if the fitting equation (JIJ is right or approximate right, the mass of up quark 
is never vanishing. This means that the solution of CP broken in QCD is not the vanishing 
of up quark mass. It is possibly from other mechanism, for instance, U(l) symmetry. 

At last, the subtract mass is mo(mz) = 7.55Mev, which corresponds mo (mo) = 428Mev, 
or roughly equals to constituent light quarks mass. 

In summary, the significant agreement of equation (|1J shows that the masses of known 
quarks are never random. Therefore, equation should be included in the full theory. We 
expect this equation should give some clue of full theory, such as the source of masses or 
flavor physics. 

The author D.Q. Liu is very grateful to Prof. J. Ferrandis for reading the manuscript and 
helpful comments. 
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Figure 1: ln(-^-) vs. n g , where n g stands for the generator of quarks while mo is pointed 
out in Tab. 4. The dashing line is ln(^-). 
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Figure 2: R 2 vs. a s {mz)- 
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